Goto

Collaborating Authors

 customer service chatbot


Experimental Evaluation of Machine Learning Models for Goal-oriented Customer Service Chatbot with Pipeline Architecture

Isa, Nurul Ain Nabilah Mohd, Jawaddi, Siti Nuraishah Agos, Ismail, Azlan

arXiv.org Artificial Intelligence

Integrating machine learning (ML) into customer service chatbots enhances their ability to understand and respond to user queries, ultimately improving service performance. However, they may appear artificial to some users and affecting customer experience. Hence, meticulous evaluation of ML models for each pipeline component is crucial for optimizing performance, though differences in functionalities can lead to unfair comparisons. In this paper, we present a tailored experimental evaluation approach for goal-oriented customer service chatbots with pipeline architecture, focusing on three key components: Natural Language Understanding (NLU), dialogue management (DM), and Natural Language Generation (NLG). Our methodology emphasizes individual assessment to determine optimal ML models. Specifically, we focus on optimizing hyperparameters and evaluating candidate models for NLU (utilizing BERT and LSTM), DM (employing DQN and DDQN), and NLG (leveraging GPT-2 and DialoGPT). The results show that for the NLU component, BERT excelled in intent detection whereas LSTM was superior for slot filling. For the DM component, the DDQN model outperformed DQN by achieving fewer turns, higher rewards, as well as greater success rates. For NLG, the large language model GPT-2 surpassed DialoGPT in BLEU, METEOR, and ROUGE metrics. These findings aim to provide a benchmark for future research in developing and optimizing customer service chatbots, offering valuable insights into model performance and optimal hyperparameters.


Simulating Field Experiments with Large Language Models

Chen, Yaoyu, Hu, Yuheng, Lu, Yingda

arXiv.org Artificial Intelligence

Prevailing large language models (LLMs) are capable of human responses simulation through its unprecedented content generation and reasoning abilities. However, it is not clear whether and how to leverage LLMs to simulate field experiments. In this paper, we propose and evaluate two prompting strategies: the observer mode that allows a direct prediction on main conclusions and the participant mode that simulates distributions of responses from participants. Using this approach, we examine fifteen well cited field experimental papers published in INFORMS and MISQ, finding encouraging alignments between simulated experimental results and the actual results in certain scenarios. We further identify topics of which LLMs underperform, including gender difference and social norms related research. Additionally, the automatic and standardized workflow proposed in this paper enables the possibility of a large-scale screening of more papers with field experiments. This paper pioneers the utilization of large language models (LLMs) for simulating field experiments, presenting a significant extension to previous work which focused solely on lab environments. By introducing two novel prompting strategies, observer and participant modes, we demonstrate the ability of LLMs to both predict outcomes and replicate participant responses within complex field settings. Our findings indicate a promising alignment with actual experimental results in certain scenarios, achieving a stimulation accuracy of 66% in observer mode. This study expands the scope of potential applications for LLMs and illustrates their utility in assisting researchers prior to engaging in expensive field experiments. Moreover, it sheds light on the boundaries of LLMs when used in simulating field experiments, serving as a cautionary note for researchers considering the integration of LLMs into their experimental toolkit.


Towards AI-Safety-by-Design: A Taxonomy of Runtime Guardrails in Foundation Model based Systems

Shamsujjoha, Md, Lu, Qinghua, Zhao, Dehai, Zhu, Liming

arXiv.org Artificial Intelligence

The rapid advancement and widespread deployment of foundation model (FM) based systems have revolutionized numerous applications across various domains. However, the fast-growing capabilities and autonomy have also raised significant concerns about responsible AI and AI safety. Recently, there have been increasing attention toward implementing guardrails to ensure the runtime behavior of FM-based systems is safe and responsible. Given the early stage of FMs and their applications (such as agents), the design of guardrails have not yet been systematically studied. It remains underexplored which software qualities should be considered when designing guardrails and how these qualities can be ensured from a software architecture perspective. Therefore, in this paper, we present a taxonomy for guardrails to classify and compare the characteristics and design options of guardrails. Our taxonomy is organized into three main categories: the motivation behind adopting runtime guardrails, the quality attributes to consider, and the design options available. This taxonomy provides structured and concrete guidance for making architectural design decisions when designing guardrails and highlights trade-offs arising from the design decisions.


Delivery Firm's AI Chatbot Goes Rogue, Curses at Customer and Criticizes Company

TIME - Tech

An AI customer service chatbot for international delivery service DPD used profanity, told a joke, wrote poetry about how useless it was, and criticized the company as the "worst delivery firm in the world" after prompting by a frustrated customer. Ashley Beauchamp, a London-based pianist and conductor, according to his website, posted screenshots of the chat conversation to X on Thursday, the same day he said in a comment that the exchange occurred. At the time of publication, his post had gone viral with 1.3 million views, and over 20 thousand likes. The humorous exchange symbolizes bigger issues as artificial intelligence has infiltrated every area of life––from art to education to business––especially with the introduction of publicly available chatbot ChatGPT. Companies have turned to AI to streamline their work, amid an ongoing debate about how effective bots are in replacing humans or whether AI will eventually outsmart us.


Customer service chatbots: How to create and use them for social media

#artificialintelligence

Exceeding customer expectations isn't as easy as it used to be. High inbound message volumes and rising customer care standards have left support teams hustling to keep resolution times low. It's officially time to call in the bots. Customer service chatbots, that is. Don't panic--no robot can replace a diligent customer service professional.


ChatGPT -- The Beginning of a New Era

#artificialintelligence

Customer Service Chatbot: ChatGPT can be used to create a customer service chatbot that can provide customers with personalized answers to their questions and route them to the right resources. The chatbot can be trained to understand customer needs and provide timely responses to their queries. Online Shopping Assistant: ChatGPT can be used to create an online shopping assistant that can help customers find and purchase products from an online store. The chatbot can be trained to understand the customer's preferences and recommend products accordingly. Educational Chatbot: ChatGPT can be used to create an educational chatbot that can help students learn by providing personalized explanations to their queries.


Customer Service Chatbots Are Falling Short: Here's Why

#artificialintelligence

Let's face it: customer experience chatbots have been a major disappointment. They currently sit squarely at the bottom of Gartner's Hype Cycle for Natural Technologies, July 2021 ("As an innovation does not live up to its overinflated expectations, it rapidly becomes unfashionable and attention wanes."). GOMoxie found only 22% of consumers have a positive impression of chatbots. A Userliike survey found that the No. 1 thing that consumers want from chatbot experiences is the ability to escalate the interaction to a human. No wonder some research firms have called the current chatbot landscape a "failed revolution." Why, despite the millions upon millions of dollars of funding going into the space, have chatbots yet to provide anywhere near a level of experience warranted from such market hype?


Improving Customer Service Chatbots with Attention-based Transfer Learning

Bird, Jordan J.

arXiv.org Artificial Intelligence

With growing societal acceptance and increasing cost efficiency due to mass production, service robots are beginning to cross from the industrial to the social domain. Currently, customer service robots tend to be digital and emulate social interactions through on-screen text, but state-of-the-art research points towards physical robots soon providing customer service in person. This article explores two possibilities. Firstly, whether transfer learning can aid in the improvement of customer service chatbots between business domains. Secondly, the implementation of a framework for physical robots for in-person interaction. Modelled on social interaction with customer support Twitter accounts, transformer-based chatbot models are initially tasked to learn one domain from an initial random weight distribution. Given shared vocabulary, each model is then tasked with learning another domain by transferring knowledge from the prior. Following studies on 19 different businesses, results show that the majority of models are improved when transferring weights from at least one other domain, in particular those that are more data-scarce than others. General language transfer learning occurs, as well as higher-level transfer of similar domain knowledge in several cases. The chatbots are finally implemented on Temi and Pepper robots, with feasibility issues encountered and solutions are proposed to overcome them.


How Chatbots are Transforming Customer Service with AI

#artificialintelligence

The traditional way of customer service falls short when it comes to meeting the constantly evolving expectations of new-age customers. So, businesses that still adhere to the old ways must change the ways if they aim to deliver greater experiences to customers at every step of their journey. This is where AI chatbots make the foray into the customer service sphere as they not only impact the support but can also automate functions across sales, or marketing verticals as well. With 50% of consumers no longer caring whether they are dealing with humans or AI-enabled assistants, bots-driven automation can definitely fill the gap in the customer service hierarchy and ensure value. Using AI bots, it becomes easy to provide better prompt assistance at various touchpoints of the customer journey, streamline the processes and boost the level of enhancing customer engagement.


Anthony Bourdain's voice-cloning for new doc called into question: It's 'a slippery slope'

FOX News

Fox News Flash top entertainment and celebrity headlines are here. Check out what's clicking today in entertainment. The revelation that a documentary filmmaker used voice-cloning software to make the late chef Anthony Bourdain say words he never spoke has drawn criticism amid ethical concerns about use of the powerful technology. The movie "Roadrunner: A Film About Anthony Bourdain" appeared in cinemas Friday and mostly features real footage of the beloved celebrity chef and globe-trotting television host before he died in 2018. But its director, Morgan Neville, told The New Yorker that a snippet of dialogue was created using artificial intelligence technology.